Advanced Data Mining Techniques

نویسندگان

  • David L. Olson
  • Dursun Delen
چکیده

The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. I dedicate this book to my grandchildren. Preface The intent of this book is to describe some recent data mining tools that have proven effective in dealing with data sets which often involve uncertain description or other complexities that cause difficulty for the conventional approaches of logistic regression, neural network models, and decision trees. Among these traditional algorithms, neural network models often have a relative advantage when data is complex. We will discuss methods with simple examples, review applications, and evaluate relative advantages of several contemporary methods. Our intent is to cover the fundamental concepts of data mining, to demonstrate the potential of gathering large sets of data, and analyzing these data sets to gain useful business understanding. We have organized the material into three parts. Part I introduces concepts. Part II contains chapters on a number of different techniques often used in data mining. Part III focuses on business applications of data mining. Not all of these chapters need to be covered, and their sequence could be varied at instructor design. The book will include short vignettes of how specific concepts have been applied in real practice. A series of representative data sets will be generated to demonstrate specific methods and concepts. References to data mining software and sites such as www.kdnuggets.com will be provided. Chapter 1 gives an overview of data mining, and provides a description of the data mining process. An overview of useful business applications is provided. Chapter 2 presents the data mining process in more detail. It demonstrates this process with a typical set of data. Visualization of data through data mining software is addressed. Chapter 3 presents memory-based reasoning methods of data mining. Major real applications are described. Algorithms are demonstrated with prototypical data based on real applications. Chapter 4 discusses association rule methods. Application in the form of market basket analysis is discussed. A real data set is described, and a simplified version used to demonstrate association rule methods. Chapter 5 presents fuzzy data mining approaches. Fuzzy decision tree approaches are described, as well as fuzzy association rule applications. Real data mining applications are described and demonstrated Chapter 6 presents Rough …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting the Next State of Traffic by Data Mining Classification Techniques

Traffic prediction systems can play an essential role in intelligent transportation systems (ITS). Prediction and patterns comprehensibility of traffic characteristic parameters such as average speed, flow, and travel time could be beneficiary both in advanced traveler information systems (ATIS) and in ITS traffic control systems. However, due to their complex nonlinear patterns, these systems ...

متن کامل

Application of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)

Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...

متن کامل

Data Mining Techniques for Mortality at Advanced Age

This paper addresses issues and techniques for advanced age mortality study using data mining techniques, a new technology on the horizon with great actuarial potential. Data mining is an interactive information discovery process that includes data acquisition, data integration, data exploration, model building, and model validation. Both expert opinion and information discovery techniques are ...

متن کامل

Knowledge Discovery in the Form of Prototypical Cases Using Advanced Data Mining Techniques

Knowledge discovery in the form of prototypical cases using advanced data mining techniques Rathnavel Rajagopal Chair of the Supervisory Committee: Asst. Professor Isabelle Bichindaritz Computing and software systems This thesis applies advanced data mining techniques to obtain structured and abstract knowledge structures (or prototypical cases) from clinical data available in a certain medical...

متن کامل

Comparing ordinary kriging and advanced inverse distance squared methods based on estimating coal deposits; case study: East-Parvadeh deposit, central Iran

Finding a proper estimation method for ore resources/reserves is important in mining engineering. The aim of this work is to compare the Ordinary Kriging (OK) and Advanced Inverse Distance Squared (AIDS) methods based on the correlation between the raw and estimated data in the East-Parvadeh coal deposit, central Iran. The variograms and anisotropic ellipsoids are calculated to estimate the ash...

متن کامل

Data mining techniques for improving the reliability of system identification

A system identification methodology that makes use of data mining techniques to improve the reliability of identification is presented in this paper. An important aspect of the methodology is the generation of a population of candidate models. Indications of the reliability of system identification are obtained through an examination of the characteristics of the population. Data mining techniq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008